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MULTIPLEX GENOTYPING USING SOLID PHASE CAP TITP&rt. P 

D I D EOXYNUCLE OTI DE S AND MASS SPECTROMETRY " 

This application is a continuation-in-part and claims 
priority of U.S. Serial No. 10/194,882, filed July 12, 
2002, the contents of which are hereby incorporated by 
reference into this application. 

Background Of The Invention 

Throughout this application, various publications are 
referenced in parentheses. Citations for these 

references may be found at the end of the specification 
immediately preceding the claims. The disclosures of 
these publications in their entireties are hereby 
incorporated by reference into this application to more 
fully describe the state of the art to which this 
invention pertains. 

Single nucleotide polymorphisms (SNPs), the most common 
genetic variations in the human genome, are important 
markers for identifying disease genes and for 
pharmacogenetic studies (1, 2) . SNPs appear in the human 
genome with an average density of once every 1000-base 
pairs (3). To perform large-scale SNP genotyping, a 
rapid, precise and cost-effective method is required. 
Matrix-assisted laser desorption/ionization time-of- 
f light mass spectrometry (MALDI-TOF MS) (4) allows rapid 
and accurate sample measurements (5-7) and has been used 
in a variety of SNP detection methods including 
hybridization (8-10), invasive cleavage (11, 12) and 
single base extension (SBE) (5, 13-17). SBE is widely 
used for multiplex SNP analysis. In this method, primers 
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designed to anneal immediately adjacent to a polymorphic 
site are extended by a single dideoxynucleotide that is 
complementary to the nucleotide at the variable site. By 
measuring the mass of the resulting extension product, a 
particular SNP can be identified. Current SBE methods to 
perform multiplex SNP analysis using MS require 
unambiguous simultaneous detection of a library of 
primers and their extension products. However, 
limitations in resolution and sensitivity of MALDI-TOF MS 
for longer DNA molecules make it difficult to 
simultaneously measure DNA fragments over a large mass 
range (6). The requirement to measure both primers and 
their extension products in this range limits the scope 
of multiplexing. 

A high fidelity DNA sequencing method has been developed 
which uses solid phase capturable biotinylated 
dideoxynucleotides (biotin-ddNTPs ) by detection with 
fluorescence (18) or mass spectrometry (19), eliminating 
false terminations and excess primers. Combinatorial 
fluorescence energy transfer tags and biotin-ddNTPs have 
also been used to detect SNPs (20) . 

False stops or terminations occur when a deoxynucleotide 
rather than a dideoxynucleotide terminates a se+quencing 
fragment. It has been shown that false stops and primers 
which have dimerized can produce peaks in the mass 
spectra that can mask the actual results preventing 
accurate base identification (21) . 

The present application discloses an approach using solid 
phase capturable biotin-ddNTPs in SBE for multiplex 
genotyping by MALDI-TOF MS. In this method primers that 
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have different molecular weights and that are specific to 
the polymorphic sites in the DNA template are extended 
with biotin-ddNTPs by DNA polymerase to generate 3'- 
biotinylated DNA extension products. The 3 ■ -biotinylated 
DNAs are then captured by streptavidin-coated magnetic 
beads, while the unextended primers and other components 
in the reaction are washed away. The pure DNA extension 
products are subsequently released from the magnetic 
beads, for example by denaturing the biotin-streptavidin 
interaction with formamide, and analyzed with MALDI-TOF 
MS. The nucleotide at the polymorphic site is identified 
by analyzing the mass difference between the primer 
extension product and an internal mass standard added to 
the purified DNA products. Since the primer extension 
products are isolated prior to MS analysis, the resulting 
mass spectrum is free of non-extended primer peaks and 
their associated dimers, which increases the accuracy and 
scope of multiplexing in SNP analysis. The solid phase 
purification' system also facilitates desalting of the 
captured oligonucleotides. Desalting is critical in 
sample preparation for MALDI-TOF MS measurement since 
alkaline and alkaline earth salts can form adducts with 
DNA fragments that interfere with accurate peak detection 
(21) . 
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This invention is directed to a method for determining 
the identity of a nucleotide present at a predetermined 
site in a DNA whose sequence immediately 3' of such 
predetermined site is known which comprises: 

(a) treating the DNA with an oligonucleotide primer 
whose sequence is 'complementary to such known 
sequence so that the oligonucleotide primer 
hybridizes to the DNA and forms a complex in 
which the 3' end of the oligonucleotide primer 
is located immediately adjacent to the 
predetermined site in the DNA; 
(b) simultaneously contacting the complex from step 
(a) with four different labeled 

dideoxynucleotides, in the presence of a 
polymerase under conditions permitting a 
labeled dideoxynucleotide to be added to the 3' 
end of the primer so as to generate a labeled 
single base extended primer, wherein each of 
the four different labeled dideoxynucleotides 
(i) is complementary to one of the four 
nucleotides present in the DNA and (ii) has a 
molecular weight which can be distinguished 
from the molecular weight of the other three 
labeled dideoxynucleotides using maS s 
spectrometry; and 
(c) determining the difference in molecular weight 
between the labeled single base extended primer 
and the oligonucleotide primer so as to 
identify the dideoxynucleotide incorporated 
into the single base extended primer and 
thereby determine the identity of the 
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nucleotide present- at the predetermined site in 
the DNA. 



In one embodiment, the method further comprises after 
step (b) the steps of: 

(i) contacting the labeled single base extended 
primer with a surface coated with a compound 
that specifically interacts with a chemical 
moiety attached to the dideoxynucleotide by a 
linker so as to thereby capture the extended 
primer on the surface; and 

(ii) treating the labeled single base extended 
primer so as to release it from the surface. 

In one embodiment, the method further comprises after 

step (i) the step of treating the surface to remove 

primers that have not been extended by a labeled 
dideoxynucleotide . 
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Figure 1: Scheme of single base extension for multiplex 
SNP analysis using biotin-ddNTPs and MALDI-TOF MS. 
Primers that anneal immediately next to the polymorphic 
sites in the DNA template are extended by DNA polymerase 
of a biotin-ddNTP in a sequence-specific manner. After 
solid phase capture and isolation of the 3' -biotinylated 
DNA extension fragments, MALDI-TOF MS was used to analyze 
these DNA products to yield a mass spectrum. From the 
relative mass of each extended primer, compared to the 
mass of an internal standard, the nucleotide at the 
polymorphic site is identified. 

Figure 2. Multiplex SNP genotyping mass spectra generated 
using biotin-ddNTPs. inset is a magnified view of 
heterozygote peaks. Masses of the extension product in 
reference to the internal mass standard were listed on 
each single base extension peak. The mass values in 
parenthesis indicate the mass difference between the 
extension products and the corresponding primers. (A) 
Detection of six nucleotide variations from synthetic DNA 
templates mimicking mutations in the p53 gene. Four 
homozygous (T, G, C and C) and one heterozygous (C/A) 
genotypes were detected. (B) Detection of two 

heterozygotes (A/G and C/G) in the human HFE gene. 

Figure 3: Structure of four mass tagged biotinylated 
ddNTPs. Any of the four ddNTPs {ddATP, ddCTP, ddGTP, 
ddTTP) can be used with any of the illustrated linkers. 

Figure 4: Synthesis scheme for mass tag linkers. For 
illustrative purposes, the linkers are labeled to 
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correspond to the specific ddNTP with which they are 
shown coupled in Figures 3, 5, 7, 8 and 9 . However, any 
of the three linkers can be used with any ddNTP. 
(i) (CF 3 CO) 2 0; (ii, Disuccinimidylcarbonate/ 

diisopropylethylamine; (iii, Propargyl amine. 

Figure 5: The synthesis of ddATP-Linker-II-1 1-Biotin . 

(i) Linker II, tetrakis (triphenylphosphine) palladium ( 0) ; 

(ii) POCl 3 , Bn«N* pyrophosphate; (iii) NH«OH; (i V ) Sulfo- 
NHS-LC-Biotin. 



n 



Figure 6: DNA products are purified by a streptavidi 
coated porous silica surface. Only the biotinylated 
fragments are captured. These fragments are then cleaved 
by light irradiation (hv) to release the captured 
fragments, leaving the biotin moiety still bound to the 
streptavidin. 



Figure 7: Mechanism for the cleavage . of photocleavable 
linkers . 



Figure 8: The structures of ddNTPs linked to 
photocleavable (PC) biotin. Any of the four ddNTPs 
(ddATP, ddCTP, ddGTP, ddTTP) can be used with any of the 
shown linkers. 

Figure 9: The synthesis of ddATP-Linker-II-PC-Biotin . PC 
= photocleavable. 

Figure 10: Schematic for capturing a DNA fragment 
terminated with a dideoxynucleoside monophosphate on a 
surface. The dideoxynucleoside monophosphate (ddNMP) 
which is on the 3' end of the DNA fragment is attached 
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via a linker to a chemical moiety "X" which interacts 
with a compound "Y" on the surface to capture the DNA 
fragment terminated with the ddNMP. The DNA fragment can 
be freed from the surface either by disrupting the 
interaction between chemical moiety X and compound Y 
(lower scheme) or by cleaving the linker (upper scheme) . 

Figures 11A-11C: Schematic of a high throughput channel 
based purification system. Sample solutions can be 
pushed back and forth between the two plates through 
glass capillaries and the streptavidin coated channels in 
the chip. The whole chip can be irradiated to cleave the 
samples after immobilization. 

Figure 12: The synthesis of streptavidin coated porous 
surface . 



Figures 13A-13C: Simultaneous detection of nucleotide 
variations in 30 codons of the p53 tumor suppressor gene 
by MALDI-TOF MS using solid phase capturable biotinylated 
dideoxynucleotide. Each peak represents a different 
polymorphism labeled with its nucleotide identity and 
absolute mass value. The value in parentheses, denoting 
the mass difference between each DNA extension product 
and its corresponding primer, is used to determine the 
nucleotide identity. (A) A mass spectrum from a Wilms' 
tumor sample showing 30 wild type p53 sequences. (B) A 
mass spectrum from a head and neck tumor (primary tumor 
biopsy) containing a heterozygous genotype G/T (4684/4734 
Da) (boxed) in codon 157, corresponding to the wild type 
and mutant alleles, respectively. (C) A mass spectrum 
from a colorectal tumor cell line (HT-29) containing a 
homozygous G to A mutation (boxed) in codon 273 of the 
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p53 gene. The colorectal tumor cell line (SW-480) 
contained the identical G to A mutation in codon 273. 

Figures 14A-14B: (A) A mass spectrum from a head and neck 
tumor sample showing 30 wild type sequences of the p53 
gene. (B) A mass spectrum from a head and neck tumor cell 
line (SCC-4) containing a homozygous C (5881 Da) to T 
(5970 Da) mutation (boxed) in codon 151 of the p53 gene. 
Both spectra were produced using the primers shown in 
Table 3 with primer 16 replaced by primer 5'- 
TGTGGGTTGATTCCACA-3 * for detecting the variation in codon 
151 (C/TCC) . 
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The following definitions are presented as an aid in 
understanding this invention. 

The standard abbreviations for nucleotide bases are used 
as follows: adenine (A), cytosine (C) , guanine (G) , 
thymine (T) , and uracil (U) . 

A nucleotide analogue refers to a chemical compound that 
is structurally and functionally similar to the 
nucleotide, i.e. the nucleotide analogue can be 
recognized by polymerase as a substrate. That is, for 
example, a nucleotide analogue comprising adenine or an 
analogue of adenine should form hydrogen bonds with, 
thymine, a nucleotide analogue comprising C or an 
analogue of C should form hydrogen bonds with G, a 
nucleotide analogue comprising G or an analogue of G 
should form hydrogen bonds with C, and a nucleotide 
analogue comprising T or an analogue of T should form 
hydrogen bonds with A, in a double helix format. 

This invention is directed to a method for determining 
the identity of a nucleotide present at a predetermined 
site in a DNA whose sequence immediately 3' of such 
predetermined site is known which comprises: 

(a) treating the DNA with an oligonucleotide primer 
whose sequence is complementary to such known 
sequence so that the oligonucleotide primer 
hybridizes to the DNA and forms a complex in 
which the 3' end of the oligonucleotide primer 
is located immediately adjacent to the 
predetermined site in the DNA; 
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(b) simultaneously contacting the complex from step 
(a) with four different labeled 

dideoxynucleotides, in the presence of a 
polymerase under conditions permitting a 
labeled dideoxynucleotide to be added to the 3' 
end of the primer so as to generate a labeled 
single base extended primer, wherein each of 
the four different labeled dideoxynucleotides 
(i) is complementary to one of the four 
nucleotides present in the DNA and (ii) has a 
molecular weight which can be distinguished 
from the molecular weight of the other three 
labeled dideoxynucleotides using mass 

spectrometry; and 
(c) determining the difference in molecular weight 
between the labeled single base extended primer 
and the oligonucleotide primer so as to 
identify the dideoxynucleotide incorporated 
into the single base extended primer and 
thereby determine the identity of the 
nucleotide present at the predetermined site in 
the DNA. 

In one embodiment, each of the four labeled 
dideoxynucleotides comprises a chemical moiety attached 
to the dideoxynucleotide by a different linker which has 
a molecular weight different from that of each other 
linker . 

In one embodiment, the method further comprises after 
step (b) the steps of: 

(i) contacting the labeled single base extended 
primer with a surface coated with a compound 
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that specifically interacts with a chemical 
moiety attached to the dideoxynucleotide by a 
linker so as to thereby capture the extended 
primer on the surface; and 

(ii) treating the labeled single base extended 
primer so as to release it from the surface. 

In a further embodiment, the method comprises after step 
(i) the step of treating the surface to remove primers 
that have not been extended by a labeled 
dideoxynucleotide and any non-captured component. 

In one embodiment of the method step (c) comprises 
determining the difference in mass between the labeled 
single base extended primer and an internal mass 
calibration standard added to the extended primer. m 
one embodiment, the internal mass standard is 5'- 
TTTTTCTTTTTCT-3' (SEQ ID NO: 5) (MW = 3855 Da) . 

In one embodiment, the chemical moiety is attached via a 
different linker to different dideoxynucleotides . In one 
embodiment, the different linkers increase mass 
separation between different labeled single base extended 
primers. and thereby increase mass spectrometry 
resolution . 



In one embodiment, the dideoxynucleotide is selected 
the group consisting of 2' , 3' -dideoxyadenosine 
triphosphate (ddATP) , 2 ' , 3' -dideoxyguanosine 

triphosphate (ddGTP) , 2 ' , 3' -dideoxycytidine 

triphosphate (ddCTP) , and 2' , 3' -dideoxythymidine 
triphosphate (ddTTP) . 
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In different embodiments of the methods described herein, 
the interaction between the chemical moiety attached to 
the dideoxynucleotide by the linker and the compound on 
the surface comprises a biotin-streptavidin interaction, 
a phenylboronic acid-salicylhydroxamic acid interaction, 
or an antigen-antibody interaction. 

in one. embodiment, the step of releasing the labeled 
single base extended primer from the surface comprises 
disrupting the interaction between the chemical moiety 
attached by the linker to the dideoxynucleotide and the 
compound on the surface. In different embodiments, the 
interaction is disrupted by a means selected from the 
group consisting of one or more of a physical means, a 
chemical means, a physical chemical means, heat, and 
light. In one embodiment, the interaction is disrupted 
by light. In one embodiment, the interaction is 

disrupted by ultraviolet light. m different 

embodiments, the interaction is disrupted by ammonium 
hydroxide, formamide, or a change in. pH (-log H* 
concentration) . 



In different embodiments, the linker can comprise a chain 
structure, or a structure comprising one or more rings, 
or a structure comprising a chain and one or more rings. 
In different embodiments, the dideoxynucleotide comprises 
a cytosine or a thymine with a 5-position, or an adenine 
or a guanine with a 7-position, and the linker is 
attached to the dideoxynucleotide at the 5-position of 
cytosine or thymine or at the 7-position of adenine or 
guanine . 
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In different embodiments, the step of releasing the 
labeled single base extended primer from the surface 
comprises cleaving the linker between the chemical moiety 
and the dideoxynucleotide . in different embodiments, the 
linker is cleaved by a means selected from the group 
consisting of one or more of a physical means, a chemical 
means, a physical chemical means, heat, and light. In 
one embodiment, the linker is cleaved by light. In one 
embodiment, the linker is cleaved by ultraviolet light. 
In different embodiments, the linker is cleaved by 
ammonium hydroxide, formamide, or a change in pH (-log H* 
concentration) ; 

In one embodiment, the linker comprises a derivative of 
4-aminomethyl benzoic acid. In one embodiment, the 
linker comprises a 2-nitrobenzyl group or a derivative of 
a 2-nitrobenzyl group. In one embodiment, the linker 
comprises one or more fluorine atoms. 
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In one embodiment, the linker is selected from the group 
consisting of: 




CH 2 NHQ(0)CF 3 




CH 2 NHC(0)CF 3 



and 




CH 2 NHC(0)CF 3 



10 



In one embodiment, a plurality of different linkers is 
used to increase mass separation between different 
labeled single base extended primers and thereby increase 
mass spectrometry resolution. 



15 



In one embodiment, the chemical moiety comprises biotin, 
the labeled dideoxynucleotide is a biotinylated 
dideoxynucleotide, the labeled single base extended 
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priraer is a biotinylated single base extended primer, and 
the surface is a streptavidin-coated solid surface. in 
one embodiment, the biotinylated dideoxynucleotide is 
selected from the group consisting of ddATP-ll-biotin, 
ddCTP-ll-biotin, ddGTP-ll-biotin, and ddTTP-16-biotin . 
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In one embodiment, the biotinylated dideoxynucleotide is 
selected from the group consisting of: 



ddNTPI' 



1 



S-' H 



ddNTP2 



ddNTP3 



o 

O F 



O F 





wherein ddNTPI , ddNTP2, ddNTP3, and ddNTP4 represent 
four different dideoxynucleotides , or their 
analogues . 
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In one embodiment, the biot inylated dideoxynucleot ide is 
selected from the group consisting of: 
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In one embodiment, the biotinylated dideoxynucleotide is 
selected from the group consisting of: 



o Y o 



0 2 N 



HN^Nh 



ddTTP' 



o 



ddATP 




O 

N^O. 



F O 

ddGTP' -™V\f 'jf^ 0 - 

O F 




HN NH 

n 

O 



H 



HN^NH 
O 



HN^NH 
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In one embodiment, the streptavidin-coated solid surface 

is a streptavidin-coated magnetic bead or a streptavidin- 
coated silica glass. 



In one embodiment of the method, steps (a) and (b) are 
performed in a single container or in a plurality of 
connected containers. 

The invention provides methods for determining the 
identity of nucleotides present at a plurality of 
predetermined sites, which comprises carrying out any of 
the methods disclosed herein using a plurality of 
different primers each having a molecular weight 
different from that of each other primer, wherein a 
different primer hybridizes adjacent to a different 
predetermined site. In one embodiment, different linkers 
each having a molecular weight different from that of 
each other linker are attached to the different 
dideoxynucleotides to increase mass separation between 
different labeled single base extended primers and 
thereby increase mass spectrometry resolution. 



In one embodiment, the mass spectrometry 
assisted laser desorption/ionization time-of 
spectrometry. 

Linkers are provided for attaching a chemical moiety to a 
dideoxynucleotide, wherein the linker comprises a 
derivative of 4 -aminomethyl benzoic acid. 



is matrix- 
flight mass 



In one embodiment, the dideoxynucleotide is selected from 
the group consisting of 2 ' , 3' -dideoxyadencsine 5'- 



WO 2004/007773 PCT/US2003/021818 

-22- 

triphosphate (ddATP) , 2' , 3' -dideoxyguanosine 5'- 

triphosphate (ddGTP) , 2 ' , 3' -dideoxycytidine 5'- 

triphosphate (ddCTP) , and 2' , 3' -dideoxy thymidine 5' - 
triphosphate (ddTTP) . 

In one embodiment, the linker comprises one or more 
fluorine atoms. 

In one embodiment, the linker is selected from the group 
consisting of: 




CH 2 NHC(0)CF 3 
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In different embodiments, the linker can comprise a chain 
structure, or a structure comprising one or more rings, 
or a structure comprising a chain and one or more rings. 

In different embodiments, the linker is cleavable by a 
means selected from the group consisting of one or more 
of a physical means, a chemical means, a physical 
chemical means, heat, and light. In one embodiment, the 
linker is cleavable by ultraviolet light. In different 
embodiments, the linker is cleavable by ammonium 
hydroxide, formamide, or a change in pH (-log H* 
concentration) . 

In different embodiments of the linker, the chemical 
moiety comprises biotin, streptavidin or related 
analogues that have affinity with biotin, phenylboronic 
acid, salicylhydroxamic acid, an antibody, or an antigen. 

In different embodiments, the dideoxynucleotide comprises 
a cytosine or a thymine with a 5-position, or an adenine 
or a guanine with a 7-position, and the linker is 
attached to the 5-position of cytosine or thymine or to 
the 7-position of adenine or guanine. 

- The invention provides for the use of any of the linkers 
described herein in single nucleotide polymorphism 
detection using mass spectrometry, wherein the linker 
increases mass separation between different 

dideoxynucleotides and increases mass spectrometry 
resolution . 

Labeled dideoxynucleotides are provided which comprise a 
chemical moiety attached via a linker to a 5-position of 
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cytosine or thymine or to a 7-position of adenine or 
guanine . 

In one embodiment, the dideoxynucleotide is selected from 
the group consisting of 2' , 3' -dideoxyadenosine 5'- 
triphosphate (ddATP) , 2' ,3' -dideoxyguanosine 5' - 

triphosphate (ddGTP) , 2' , 3' -dideoxycytidine 5'- 

triphosphate (ddCTP) , and * 2 ' , 3' -dideoxythymidine 5'- 
triphosphate (ddTTP) . 

In different embodiments, the linker can comprise a chain 
structure, or a structure comprising one or more rings, 
or a structure comprising a chain and one or more rings. 
In different . embodiments, the linker is cleavable by a 
means selected from the group consisting of one or more 
of a physical means, a chemical means, a physical 
chemical means, heat, and light. In one embodiment, the 
linker is cleavable by ultraviolet light. In different 
embodiments, the linker is cleavable by ammonium 
hydroxide, formamide, or a change in pH -log (H* 
concentration] . 



In different embodiments of the labeled 

dideoxynucleotide, the chemical moiety comprises biotin, 
streptavidin, phenylboronic acid, salicylhydroxamic acid, 
an antibody, or an antigen. 
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In one embodiment, the labeled dideoxynucleotide i< 
selected from the group consisting of: 



ddNTPI 




S— ' H 



ddNTP2 



ddNTP3 



O 



ddNTP 




and 




wherein ddNTPl, ddNTP2, ddNTP3, and ddNTP4 represent 
four different dideoxynucleotides , or their 
analogues . 
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In one embodiment, the labeled dideoxynucleotide is 
selected from the group consisting of: 
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In one embodiment, the labeled dideoxynucleotide is 
selected from the group consisting of: 

M 



ddTTP ' 



ddATP 



H 



ddGTP^ 

6 x f 

0 2 N 



Y 



HN NH 

If 
O 



HN^NH 



-Q^a Y~~~13 
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In one embodiment, the labeled dideoxynucleotide has a 
molecular weight of 844, 977, 1, 017, or 1,051. m one 
embodiment, the labeled dideoxynucleotide has a molecular 
weight of 1,049, 1,182, 1,222, or 1,257. other molecular 
weights with sufficient mass differences to allow 
resolution in mass spectrometry can also be used. 

In one embodiment the mass spectrometry is matrix- 
assisted laser desorption/ionization time-of-f light mass 
spectrometry . 

A system is provided for separating a chemical moiety 
from other components in a sample in solution, which 
comprises : 

(a) a channel coated with a compound that 
specifically interacts with the chemical moiety 
at the 3' end of the DNA fragment, wherein the 
channel comprises a plurality of ends; 

(b) a plurality of wells each suitable for holding 
the sample ; 

(c) a connection between each end of the channel 
and a well; and 

(d) a means for moving the sample through the 
channel between wells. 

In one embodiment of the system, the interaction between 
the chemical moiety and the compound coating the surface 
is a biotin-streptavidin interaction, a phenylboronic 
acid-salicylhydroxamic acid interaction, or an antigen- 
antibody interaction. 

In one embodiment, the chemical moiety is a biotinylated 
moiety and the channel is a streptavidin-coated silica 
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glass channel. In one embodiment, the biotinylated moiety 
is a biotinylated DNA fragment. 

In one embodiment, the chemical moiety can be freed from 
the surface by disrupting the interaction between the 
chemical moiety and the compound coating the surface. In 
different embodiments, the interaction can be disrupted 
by a means selected from the group consisting of one or 
more of a physical means, a chemical means, a physical 
chemical means, heat, and light. In different 
embodiments, the interaction can be disrupted by ammonium 
hydroxide, formamide, or a change in pH -log [H* 
concentration] . 

In one embodiment, the chemical moiety is attached via a 
linker to another chemical compound. In one embodiment, 
the other chemical compound is a DNA fragment. In one 
embodiment, the linker is cleavable by a means selected 
from the group consisting of one or more of a physical 
means, a chemical means, a physical chemical means, heat, 
and light. In one embodiment, the channel is transparent 
to ultraviolet light and the linker is cleavable by 
ultraviolet light. Cleaving the linker frees the DNA 
fragment or other chemical compound from the chemical 
moiety which remains captured on the surface. 

Multi-channel systems are provided which comprise a 
plurality of any of the single channel systems disclosed 
herein. In one embodiment, the channels are in a chip. 
In one embodiment, the multi-channel system comprises 96 
channels in a chip. Chips can also be used with fewer or 
greater than 96 channels. 
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The invention provides for the use of any of the 
separation systems described herein for single nucleotide 
polymorphism detection. 

This invention will be better understood from the 
Experimental Details which follow. However, one skilled 
in the art will readily appreciate that the specific 
methods and results discussed are merely illustrative of 
the invention as described more fully in the claims which 
follow thereafter. 
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Experimental Set I 
A. Materials and Methods 

PGR amplification. DNA templates containing the 

polymorphic sites for the human hereditary 
hemochromatosis gene HFE were amplified from genomic DNA 
in a total volume of 10 jil, that contains 20 ng of 
genomic DNA, 500 pmol each of forward (C282Y; 5'- 
CTACCCCCAGAACATCACC-3' (SEQ ID NO: 1), H63D; 5' - 
GCACTACCTCTTCATGGGTGCC-3 ' (SEQ ID NO: 2)) and reverse 
(C282Y; 5 ' -CATCAGTCACATACCCCA-3 1 (SEQ ID NO: 3), H63D; 
5' -CAGTGAACATGTGATCCCACCC-3 9 (SEQ ID NO: 4)) primers, 
25 ^iM dNTPs (Amersham Biosciences, Piscataway, NJ) , 1 u 
Taq polymerase (Life Technologies, Rockville, MD) , and lx 
PCR buffer (50 mM KCl, 1.5 mM MgCl 2 , 10 mM Tris-HCl). PCR 
amplification reactions were started at 94 °C for 4 min, 
followed by 45 cycles of 94 °C for 30 s, 59 °C for 30 s 
and 72 °C for 10 s, and finished with an additional 
extension step of 72 °C for 6 min. Excess primers and 
dNTPs were degraded by adding 2 U shrimp alkaline 
phosphatase (Roche Diagnostics, Indianapolis, IN) and E. 
Coli exonuclease I (Boehringer Mannheim, Indianapolis, 
IN) in lx shrimp alkaline phosphatase buffer. The 
reaction mixture was incubated at 37 °C for 45 min 
followed by enzyme inactivation at 95 Q C for 15 min. 

Single base extension using biotin-ddNTPs . The synthetic 
DNA templates containing six nucleotide variations in p53 
gene and the five primers for detecting these variations 
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are shown in Table 1. These oligonucleotides and an 
internal mass standard (5' -TTTTTCTTTTTCT-3' (SEQ ID NO: 
5), MW = 3855 Da) for MALDI-TOF MS measurement were made 
using an Expedite nucleic acid synthesizer (Applied 
Biosystems, Foster City, CA) . SBE reactions contained 20 
pmol of printer, 10 pmol of biotin-ll-ddATP, 20 pmol of 
biotin-ll-ddGTP, 40 pmol of biotin-ll-ddCTP (New England 
Nuclear Life Science, Boston," MA), 80 pmol of biotin-16- 
ddUTP (Enzo Diagnostics, Inc., Farmingdale, NY), 2 pi 
Thermo Sequenase reaction buffer, 1 U Thermo Sequenase in 
its diluted buffer (Amersham Biosciences) and 20 pmol of 
either synthetic template or 10 ^il PGR product in a total 
reaction volume of 20 nl . For SBE using synthetic 
template 1, 10 pmol of both wild type and mutated 
templates were combined with 20 pmol of primers 1 and 3 
or 20 pmol of primers 2 and 4. The SBE reaction of 
primer 5 was performed with template 2 in a separate 
tube. PCR products from the HFE gene were mixed with 
20 pmol of the corresponding primers 5'- 

GGGGAAGAGCAGAGATATACGT-3 1 (SEQ ID NO: 6) (C282Y) and 5'- 
GGGGCTCCACACGGCGACTCTC-AT-3 1 (SEQ ID NO: 7) (H63D) in SBE 
to detect the two heterozygous genotypes. All extension 
reactions were thermalcycled for 35 cycles at 94 °C for 
10 s and 49 °C for 30 s. 

Solid phase purification. 20 j^l of the s treptavidin- 
coated magnetic beads (Seradyn, Ramsey, MN) were washed 
with modified binding and washing (B/W) buffer (0.5 mM 
Tris-HCl buffer., 2 M NH 4 C1, 1 mM EDTA, pH 7.0) and 
resuspended in 20 jil modified B/W buffer. Extension 
reaction mixtures of primers 1-4 with template 1 and 
primer 5 with template 2 were mixed in a 2:1 ratio, while 
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extension reaction mixtures from the PCR products of HFE 
gene were mixed in equal amounts. 20 jil of each mixed 
extension product was added to the suspended beads and 
incubated for 1 hour. After capture, the beads were 
washed twice with modified B/W buffer, twice with 0.2 M 
triethyl ammonium acetate (TEAA) buffer and twice with 
deionized water. The primer extension products were 
released from the magnetic beads by treatment with 8 pi 
98 «% formamide solution containing 2 % 0.2 M TEAA buffer 
at 94 °C for 5 min. The released primer extension 
products were precipitated with 100 % ethanol at 4 °C for 
30 min, and centrifuged at 4 °C and 14000 RPM for 35 min. 

MALDI-TOF MS analysis. The purified primer extension 
products were dried and re-suspended in 1 jil deionized 
water and 2 \xl matrix solution. The matrix solution was 
made by dissolving 35 mg of 3-hydroxypicolinic acid (3- 
HPA; Aldrich, Milwaukee, WI) and 6 mg of ammonium citrate 
(Aidrich) in 0.8 ml of 50 % acetonitrile . 10 pmol 

internal mass standard in 1 ^1 of 50 % acetonitrile was 
then added to the sample. 0 . 5 \xl of this mixture 
containing the primer extension products and internal 
standard was spotted on a stainless steel sample plate, 
air-dried and analyzed using an Applied Biosystems 
Voyager DE Pro MALDI-TOF mass spectrometer. All 
measurements were taken in linear positive ion mode with 
a 25 kV accelerating voltage, a 94 % grid voltage and a 
350 ns delay time. The obtained spectra were processed 
using the Voyager data analysis package. 
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B. Detection of Single Nucleotide Polymorphism Using 
Biotinylated Dideoxynucleotides and Mass Spectrometry 

Solid phase capturable biotinylated dideoxynucleotides 
(biotin-ddNTPs) were used in single base extension for 
multiplex genotyping by mass spectrometry (MS) . In this 
method, oligonucleotide primers that have different 
molecular weights and that are specific to the 
polymorphic sites in the DNA template are extended with 
biotin-ddNTPs by DNA polymerase to generate 3»- 
biotinylated DNA extension products (Figure 1) . These 
products are then captured by streptavidin-coated solid 
phase magnetic beads, while the unextended primers and 
other components in the reaction are washed away. The 
pure extension DNA products are subsequently released 
from the solid phase and analyzed with matrix-assisted 
laser desorption/ionization time-of -flight MS. The mass 
of the extension DNA products is determined using a 
stable oligonucleotide as a common internal mass 
standard. Since only the pure extension DNA products are 
introduced to MS for analysis, the resulting mass 
spectrum is free of non-extended primer peaks and their 
associated dimers, which increases the accuracy and scope 
of multiplexing in single nucleotide polymorphism (SNP) 
analysis. The solid phase purification approach also 
facilitates desalting of the captured oligonucleotides, 
which is essential for accurate mass measurement by MS. 

Four biotin-ddNTPs with distinct molecular weights were 
selected to generate extension products that have a two- 
fold increase in mass difference compared to that with 
conventional ddNTPs . This increase in mass difference 
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provides improved resolution and accuracy in detecting 
heterozygotes in the mass spectrum. 

The "lock and key" functionality of biotin and 
streptavidin is often utilized in biological sample 
preparation as a way to remove undesired impurities (23) . 
In different embodiments of the methods described herein, 
affinity systems other than biotin-streptavidin can be 
used. Such affinity systems include but are not limited 
to phenylboronic acid-salicylhydroxamic acid (31) and 
antigen-antibody systems. 



.The multiplex genotyping approach was validated by 
detecting six nucleotide variations from synthetic DNA 
templates that mimic mutations in exons 7 and 8 of the 
p53 gene. Sequences of the templates and the 

corresponding primers are shown in Table 1 along with the 
masses of the primers and their extension products. The 
mass increase of the resulting single base extension 
products in comparison with the primers is 665 Da for 
addition of biot in-ddCTP, 688 Da for addition of biotin- 
ddATP, 704 Da for addition of biotin-ddGTP and 754 Da for 
addition of biotin-ddUTP . The mass data in Table 1 
indicate that the smallest mass difference among any 
possible extensions of a primer is 16 Da (between biotin- 
ddATP and biotin-ddGTP) . This is a substantial increase 
over the smallest mass difference between extension 
products using standard ddNTPs (9 Da . between ddATP and 
ddTTP) . This mass increase yields improved resolution of 
the peaks in the mass spectrum. Increased mass 

difference in ddNTPs fosters accurate detection of 
heterozygous genotypes (15), since an A/T heterozygote 
with a mass difference of 9 Da using conventional ddNTPs 



WO 2004/007773 PCT/US2003/021818 

-37- 

can not be well resolved in the MALDI-TOF mass spectra. 
The five primers for each polymorphic site were designed 
to produce extension products without overlapping masses. 
Primers extended by biotin-ddNTPs were purified and 
analyzed by MALDI-TOF MS according to the scheme in 
Figure 1. Extension products of all five primers were 
well-resolved in the mass spectrum free from any 
unextended primers (Figure 2A) , allowing each nucleotide 
variation to be unambiguously identified. Unextended 
primers occupy the mass range in the mass spectrum 
decreasing the scope of multiplexing, and excess primers 
can dimerize to form false peaks in the mass spectrum 
(21) . The excess primers and their associated dimers 
also compete for the ion current, reducing the detection 
sensitivity of MS for the desired DNA fragments. These 
complications were completely removed by carrying out SBE 
using biotin-ddNTPs and solid phase capture. Extension 
products for all four biotin-ddNTPs were clearly detected 
with well resolved mass values. The relative masses of 
the primer extension products in comparison to the 
internal mass standard revealed the identity of each 
nucleotide at the polymorphic site. In the case of 
heterozygous genotypes, two peaks, one corresponding to 
each allele (C/A) , are clearly distinguishable in the 
mass spectrum shown in Figure 2A. 
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Table 1. Oligonucleotide primers and synthetic DNA 
templates for detecting mutations in the p53 gene. 
(Top) The sequences and the calculated masses of 
primers and the four possible single base extension 
products relative to the internal mass standard are 
listed. The bold numbers refer to the nucleotide 
variations detected in the p53 gene. (Bottom) The six 
nucleotide variations in template 1 and 2 are shown 
in bold letters. Template 1 contains a heterozygous 
genotype (G/T) . Primers 1-5 = SEQ ID NOs: 8-12, 
respectively. 



Primers 


Primer sequences 


Masses 


Masses of single base extension 




iOa) 




products (Da) 










Biotin- 


Biotin- 


Biot in- 


Biotin- 








ddCTP 


ddATP 


ddGTP 


ddUTP 








A665 


A6S8 


A704 


J754 


1 


5 ' -AGAGGATCCAACCGAGAC- 3 ' 


1656 


2321 


2344 


2360 


2410 


2 


5' - 


3350 


4015 


4038 


4054 


4103 




TGGTGGTAGG7GATGTTGATGTA- 










3 


3' 


2833 


3498 


3521 


3538 


3587 


4 


5' - 


2134 


2799 


2822 | 


2838 


2480 




CACATTGTCAAGGACGTACCCG-3 ' 












5 




2507 


3172 


3195 


3211 


3261 




5' -TACCCGCCGTACTTGGCCTC- 
3' 














5' -TCCACGCACAAACACGGACAG- 














3' 













Templates 



Template sequences 



5'- 



TACCCG/TGAGGCCAAGTACGGCGGGTACGTCCTTGACAATGTGTACATCAACATCACCTACCACCATGT 
CAGTCTCGGTTGGA7CCTCTATTGTGTCCGGG- 3 ' (SEQ ID NO: 13) 



5 f - 



GAAGGAGACACGCGGCCAGAGAGGGTCCTGTCCGTGTTTGTGCGTGGAGTTTCGACAAGGCAGGGTCAT 
CTAATGGTGATGAGTCCTATCCTTTTCTCTTCGTTCTCCGT- 3 1 (SEQ ID NO: 14) 
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One advantage of MALDI-TOF MS in comparison to other 
detection techniques is its ability to simultaneously 
measure masses of DNA fragments over a certain range. 



In order to explore this feature to detect multiple SNPs 
in a single spectrum, if unextended primers are not 
removed, masses of all primers and their extension 
products must have sufficient differences to' yield 
adequately resolved peaks in the mass spectrum. Rcss et 
al. simultaneously detected multiple SNPs by carefully 
tuning the masses of all primers and extension products 
so that they would lie in the range of 4.5 kDa and 
7.6 kDa without overlapping (14). Since the unextended 
primers occupy the mass range in the mass spectrum, by 
eliminating them, the approach disclosed herein will 
significantly increase the scope of ' multiplexing in SNP 
analysis . 

To demonstrate the ability of this method to discriminate 
SNPs in genomic DNA, two disease associated SNPs were 
genotyped in the human hereditary hemochromatosis (HHC) 
gene HFE. HHC is a common genetic condition in 

Caucasians with approximately 1/400 Caucasians homozygous 
for the C282Y mutation leading to iron overload and 
potentially liver failure, diabetes and depression (22) . 
A subset of individuals who are compound heterozygotes 
for the C282Y and H63D mutations also manifest iron 
overload. Because of the high prevalence of these 
mutations and the ability to prevent disease 
manifestations by phlebotomy, accurate methods for 
genotyping these two SNPs will foster genetic screening 
for this condition. Two PCR products were generated from 
human genomic DNA for the C282Y and H63D polymorphic 
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sites of the HFE gene and then used these products for 
SBE with biotin-ddNTPs. After the extension reaction, 
products were purified using solid phase capture 
according to the scheme in Figure 1 and analyzed by 
MALDI-TOF MS. The mass spectrum obtained from this 
experiment is shown in Figure 2B. Extension products of 
each primer were readily identified by their mass 
relative to the internal mass standard. Heterozygous 
genotypes of A/G and C/G with a mass difference of 16 Da 
and 39 Da respectively were accurately detected at the 
C282Y and' H63D polymorphic sites. 

These results indicate that the use of solid phase 
capturable biotin-ddNTPs in SBE, coupled with MALDI-TOF 
MS detection, provides a rapid and accurate method for 
multiplex SNP detection over broad mass ranges and should 
greatly increase the number of SNPs that can be detected 
simultaneously. In multiplex SBE reactions, the 

oligonucleotide primers and their dideoxynucleot ide 
extension products differ by only one base pair, which 
requires analytical techniques with high resolution to 
resolve. In addition, a primer designed to detect one 
polymorphism and an extension product from another 
polymorphic site may have the same size, which can not be 
separated by electrophoresis and other conventional 
chromatographic or size exclusion methods. Methods for 
purifying DNA samples using the strong interaction of 
biotin and streptavidin are widely used (23-27). By 
introducing the biotin moiety at the 3' end of DNA, the 
solid phase based affinity purification approach 
described here is a unique and effective method to remove 
the oligonucleotide primers from the dideoxynucleotide 
extension products . 
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To increase the stability of DNA fragments for MALDI-TOF 
MS measurement in multiplex SNP analysis, nucleotide 
analogues (28) and peptide nucleic acid (9) can be used 
in the construction of the oligonucleotide primers. It 
has been shown that MALDI-TOF MS could detect DNA 
fragments up to 100 bp with sufficient resolution (29).. 
The mass difference between each adjacent DNA fragment is 
approximately 300 Da. Thus, with a mass difference of 
100 Da for each primer in designing a multiplex SNP 
analysis project, at least 300 SNPs can be analyzed in a 
single spot of the sample plate by MS . It is a routine 
method now to place 384 spots in each sample plate in MS 
analysis. Thus, each plate can produce over 100,000 
SNPs, which is roughly the entire SNPs in all the coding 
regions of the human genome. This level of multiplexing 
should be achievable by mass tagging the primers with 
stable chemical groups in SBE using biotin-ddNTPs . For 
SNP sites of interest, a master database of primers and 
the resulting masses of all four possible . extension 
products can be constructed. The experimental data from 
MALDI-TOF MS can then be compared with this database to 
precisely identify the library of SNPs automatically. 
This method coupled with future improvements in mass 
spectrometer detector sensitivity (30) will provide a 
platform for high-throughput SNP identification unrivaled 
in speed and accuracy. 

C. Design and Synthesis of Biotinylated 

dideoxynucleotides with Mass Tags 

The ability to distinguish various bases in DNA using 
mass spectrometry is dependent on the mass differences of 
the bases in the spectra. For the above work, the 
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smallest difference in mass between any two nucleotides 
is 16 daltons (see Table 1). Fei et al. (15) have shown 
that using dye-labeled ddNTP paired with a regular dNTP 
to space out the mass difference, an increase in the 
detection resolution in a single nucleotide extension 
assay can be achieved. To enhance the ability to 
distinguish peaks in the spectra, the current application 
discloses systematic modification of the biotinylated 
dideoxynucleotides by incorporating mass linkers 
assembled using 4-aminomethyl benzoic acid derivatives to 
increase the mass separation of the individual bases. The 
mass linkers can be modified by incorporating, one or two 
fluorine atoms to further space out the mass differences 
between the nucleotides. The structures of four 

biotinylated ddNTPs are shown in Figure 3. ddCTP-11- 
biotin is commercially available (New England Nuclear, 
Boston) . ddTTP-Linker I-ll-Biotin, ddATP-Linker 11-11- 
Biotin and ddGTP-Linker I II-l 1-Biotin are synthesized as 
shown, for example, for ddATP-Linker II-ll-Biotin in 
Figure 5. In designing these mass tag linker modified 
biotinylated ddNTPs, the linkers are attached to the 5- 
position on the pyrimidine bases (C and T) , and to the 7- 
position on the purines (A and G) for subsequent 
conjugation with biotin. It has been established that 
modification of these positions on the bases in the 
nucleotides, even with bulky energy transfer fluorescent 
dyes, still allows efficient incorporation of the 
modified nucleotides into the DNA strand by DNA 
polymerase (32, 33) . Thus, the ddNTPs-Linker-ll-biotin 
can be incorporated into the growing strand by the 
polymerase in DNA sequencing reactions. 
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Larger mass separations will greatly aid in longer read 
lengths where signal intensity is smaller and resolution 
is lower. The smallest mass difference between two 
individual bases is over three times as great in the mass 
5 tagged biotinylated ddNTPs compared to normal ddNTPs and 

more than double that achieved by the standard 

biotinylated ddNTPs as shown in Table 2. 

Table 2. Relative mass differences (daltons) of 
10 dideoxynucleotides using ddCTP as a reference. 



Base 


Standard 
ddNTP 


Commercial 
Biotinylated 
ddNTP 


Biotinylated ddNTP 
with mass tag 
linker 


C relative 
to C 


0 


0 


0 (no linker) 


T relative 
to C 


15 


89 (16 
linker) 


125 (Linker I) 


A relative 
to C 


24 


24 


165 (Linker II) 


G relative 
to C 


40 


40 


200 (Linker III) 


Smallest 
relative 
difference 


9 


16 


35 



Three 4-aminomethyl benzoic acid derivatives Linker I , 
Linker II and Linker III are designed as mass tags as 
well as linkers for bridging biotin to the corresponding 
dideoxynucleotides. The synthesis of Linker II (Figure 
4) is described here to illustrate the synthetic 
procedure. 3-Fluoro-4-aminomethyl benzoic acid that can 
be easily prepared via published procedures (41, 42) is 
first protected with trif luoroacetic anhydride, then 
converted to N-hydroxysuccinimide (NHS) ester with 
disuccinimidylcarbonate in the presence of 

diisopropylethylamine . The resulting NHS ester is 

subsequently coupled with commercially available 



20 
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propargylamine to form the desired compound, Linker II. 
Using an analogous procedure, Linker I and Linker III can 
be easily constructed. 

Figure 5 describes the scheme required to prepare 
biotinylated ddATP-Linker II-ll-Biotin using well- 
established procedures (34-36). 7-i-ddA is coupled with • 
•linker II in the presence of tetrakis ( triphenylphosphine) 
palladium(O) to produce 7-Linker II-ddA, which is 
phosphdrylated with P0C1 3 in butylammonium pyrophosphate 
(37). After removing the tr if luoroacetyl group with 
ammonium hydroxide, 7-Linker II-ddATP is produced, which 
then couples with sulf o-NHS-LC-Biot in (Pierce, Rockford 
ID to yield the desired ddATP-Linker II-ll-Biotin. 
Similarly, ddTTP-Linker I-ll-Biotin, and ddGTP-Linker 
III-ll-Biotin can be synthesized. 

D. Design and Synthesis of Mass Tagged ddNTPs Containing 
Photocleavable Biotin 

A schematic of capture and cleavage of the photocleavable 
linker on the streptavidin coated porous surface is shown 
in Figure 6. At the end of the reaction, the reaction 
mixture consists of excess primers, enzymes, salts, false 
stops, and the desired DNA fragment. This reaction 
mixture is passed over a streptavidin-coated surface and 
allowed to incubate. The biotinylated fragments are 
captured by the streptavidin surface, while everything 
else in the mixture is washed away. Then the fragments 
are released into solution by cleaving the photocleavable 
linker with near ultraviolet (UV) light, while the biotin 
remains attached to the streptavidin that is covalently 
bound to the surface. The pure DNA fragments can then be 
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crystallized in matrix solution and analyzed by mass 
spectrometry. it is advantageous to cleave the biotin 
moiety since it contains sulfur which has several 
relatively abundant isotopes. The rest of the DNA 
fragments and linkers contain only carbon, nitrogen, 
hydrogen, oxygen, fluorine and phosphorous, whose 
dominant isotopes are found with a relative abundance of 
99% to 100%. This allows high resolution mass spectra to 
be obtained. The photocleavage mechanism (38, 39) is 
shown in Figure 7. Upon irradiation with ultraviolet 
light at 300-350 nm, the light sensitive o-ni troaromatic 
carbonamide functionality on DNA fragment 1 is cleaved, 
producing DNA fragment 2, PC-biotin and carbon dioxide. 
The partial chemical linker remaining on DNA fragment 2 
is stable for detection by mass spectrometry. 

Four new biotinylated ddNTPs disclosed here, ddCTP-PC- 
Biotin, ddTTP-Linker I-PC-Biotin, ddATP-Linker II-PC- 
Biotin and ddGTP-Linker III-PC-Biotin are shown in Figure 
8. These compounds are synthesized by a similar 

chemistry as shown for the synthesis of ddATP-Linker II- 
11-Biotin in Figure 6. The only difference is that in 
the final coupling step NHS-PC-LC-Biotin (Pierce, 
Rockford IL) is used, as shown in Figure 9. . The 
photocleavable linkers disclosed here allow the use of 
solid phase capturable terminators and mass spectrometry 
to be turned into a high throughput technique for DNA 
analysis . 
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E. Overview of capturing a DNA fragment terminated with a 
ddNTP on a surface and freeing the ddNTP and DNA fragment 

The DNA fragment is terminated with a dideoxynucleoside 
monophosphate (ddNMP) . The ddNMP is attached via a 
linker to a chemical moiety ("X" in Figure 10) . The DNA 
fragment terminated with ddNMP is captured on the surface 
through interaction between chemical moiety "X" and a 
compound on or attached to the surface ("Y" in Figure 
10) . The present application discloses two methods for 
freeing the captured DNA fragment terminated with ddNMP. 
In the situation illustrated in the lower part of Figure 
,10, the DNA fragment terminated with ddNMP is freed from 
the surface by disrupting or breaking the interaction 
between chemical moiety "X" and compound "Y". In the 
upper part of Figure 10, the DNA fragment terminated with 
ddNMP is attached to chemical moiety "X" via a cleavable 
linker which can be cleaved to free the DNA fragment 
terminated with ddNMP. 



Different moieties and compounds can be used for the "X" 
- "Y" affinity system, which include but are not limited 
to, biotin-streptavidin, phenylboronic acid- 

salicylhydroxamic acid (31) , and antigen-antibody 
systems . 



In different embodiments, the cleavable linker can be 
cleaved and the "X" - W Y" interaction can be disrupted by 
a means selected from the group consisting of one or more 
of a physical means, a chemical means, a physical 
chemical means, heat, and light. In one embodiment, 

ultraviolet light can be used to cleave the cleavable 
linker. Chemical means include, but are not limited to, 
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ammonium hydroxide (40), formamide, or a change in pH (- 
log H + concentration) of the solution. 



F . High density streptavidin-coated, porous silica 
channel sys tern . 



Streptavidin coated magnetic beads are not ideal for 
using the photocleavable biotin capture and release 
process for DNA fragments, since they are not transparent 
to UV light. Therefore, the photocleavage reaction is not 
efficient. For efficient capture of the biotinylated 
fragments, a high-density surface coated with 
streptavidin is essential. It is known that the 

commercially available 96-well streptavidin coated plates 
cannot provide a sufficient surface area for efficient 
capture of "the biotinylated DNA fragments. Disclosed in 
this application is a porous silica channel system 
designed to* overcome this limitation. 

To increase the surface area available for solid phase 
capture, porous channels are coated with a high density 
of streptavidin. For example, ninety-six (96) porous 
silica glass channels can be etched into a silica chip 
(Figure 11) . The surfaces of the channels are modified 
to contain streptavidin as shown in Figure 12. The 
channel is first treated with 0.5 M NaOH, washed with 
water, and then briefly pre-etched with dilute hydrogen 
fluoride. Upon cleaning with water, the capillary channel 
is coated with high density 3-aminopropyltrimethoxysilane 
in aqueous ethanol (43). An excess of disuccinimidyl 
glutarate in N, N-dimethylf ormamide (DMF) is then 
introduced into the capillary to ensure a highly 
efficient conversion of the surface end group to a 
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succinimidyl ester. Streptavidin is then conjugated with 
the succinimidyl ester to form a high-density surface 
using excess streptavidin solution. The resulting 96- 
channel chip is used as a purification cassette. 

A 96-well plate that can be used with biotinylated 
terminators for DNA analysis is shown in Figure 11. in 
the example shown, each end of a channel is connected to 
a single well. However, for other applications, the end 
of a channel could be connected to a plurality of wells. 
Pressure is applied to drive the samples through a glass 
capillary into the channels on the chip. Inside the 
channels the biotin is captured by the covalently bound 
streptavidin. After passing through the channel, the 
sample enters into a clean plate in the other end of the 
chip. Pressure applied in reverse drives the sample 
through the channel multiple times and ensures a highly 
efficient solid phase capture. Water is similarly added 
to drive out the reaction mixture and thoroughly wash the 
captured fragments. After washing, the chip is 

irradiated with ultraviolet light to cleave the 
photosensitive linker and release the DNA fragments. The 
fragment solution is then driven out of the channel and 
into a collection plate. After matrix solution is added, 
the samples are spotted on a chip and allowed to 
crystallize for detection by MALDI-TOF mass spectrometry. 
The purification cassette is cleaned . by chemically 
cleaving the biotin-streptavidin linkage, and is then 
washed and reused. 
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A. Synopsis 

The following experiments show the simultaneous 
genotyping of 30 nucleotide variations in the p53 gene 
from human tumors in one tube, by using solid phase 
capturable dideoxynucleotides to generate single base 
extension products which are detected by mass 
spectrometry. Both homozygous and heterozygous genotypes 
are accurately determined with digital resolution. This 
is the highest level of SNP multiplexing reported thus 
far using mass spectrometry, indicating the approach will 
have wide applications in screening a repertoire of 
genotypes in candidate genes as potential markers for 
cancer and other diseases. 



B. Introduction 



With the completion of the Human Genome Project, a stage 
has been set to screen genetic mutations for identifying 
disease genes in a genomewide scale (44). Matrix-assisted 
laser desorption/ionization time-of -flight mass 

spectrometry (MALDI-TOF MS), which allows rapid DNA 
sample measurement yielding digital data, has been 
explored to detect single nucleotide polymorphisms (SNPs) 
using invasive cleavage (11) and primer-directed base 
extension (14, 45). Conventional single base extension 
(SBE) methods using MS to measure multiplex SNPs require 
unambiguous simultaneous detection of a library of 
primers and their extension products. However, 
limitations in resolution and sensitivity of MALDI-TOF MS 
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for longer DNA molecules make it difficult to 
simultaneously measure DNA fragments over a large mass 
range. The requirement to measure both primers and their 
extension products in this range limits the scope of 
multiplexing. The use of MALDI-TOF MS and molecular 
affinity for multiplex digital SNP detection using solid 
phase capturable (SPC) dideoxynucleotides and SBE has 
recently been explored, establishing the feasibility of 
simultaneously measuring 20 SNPs in synthetic DNA 
templates (46) . This study shows the simultaneous 
genotyping of 30 nucleotide variations, corresponding to 
known sites of cancer-associated somatic mutations, in 
exons 5, 7 and 8 of the p53 gene from human tumors in one 
tube using the SPC-SBE method. This is the highest level 
of multiplexing reported thus far using mass spectrometry 
for SNP analysis. 



C. Materials and Methods 

Multiplex PCR and single base extension reactions 

Multiplex PCR was performed to amplify 3 regions in exons 
5, 7 and 8 of the p53 gene. The primers for each region 
were 5 1 -TATCTGTTCACTTGTGCCC-3 1 (exon 5, forward), 5'- 
CAGAGGCCTGGGGA-CCCTG-3' (exon 5, reverse), 5'- 

CTGCTTGCCACAGGTCTC-3' (exon 7, forward) , 5'-CACAGCAG- 
GCCAGTGTGC-3 ■ (exon 7, reverse), 5 1 -GGACCTGATTTCCTTAC-TG- 
3' (exon 8, forward), and 5 ' -TGAATCTGAGGCATAACTG-3 1 (exon 
8, reverse) . The 45 1 PCR reaction consisted of 180 ng 
genomic DNA, 1.5 nmol dNTP, 4.5 1 10X PCR buffer, 15 mM 
MgCl 2 , 4 pmol of forward and reverse primers for exons 5 
and 7, 6 pmol of forward and reverse primers for exon 8, 
and 1.0 U of JumpStart RedAccuTaq DNA Polymerase. After 
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a 5 min 96 °C hot start, the touchdown PCR program was 
performed with 10 cycles of 96 °C (30 sec), 67 °C to 57 °c 
(-1.0 °C per cycle, 30 sec) and 72 °C (30 sec), an 
additional 30 cycles of 96 °C (30 sec), 57 °c (30 sec) and 
72 °C (30 sec), and a final extension at 72 °C for 7 min. 
The 30 SBE primers (Table 3) were designed to yield 
extension products with a sufficient mass difference and 
to be extended simultaneously in a single tube. Primer 
sequences were designed to avoid any overlap in mass, and 
the formation of secondary structures. To evenly 

separate the masses of such a large number of primers for 
SBE, some primers were synthesized using methyl-dC and dU 
phosphoramidites (Glen Research) to replace dC and dT 
respectively. Substitution of dC by methyl-dC increased 
the primer mass by 14 Da whereas a change from dT to da 
decreased the mass by 14 Da. Primers were synthesized 
using an Applied Biosystems DNA synthesizer. The 
procedures for the S3E, solid phase purification and 
MALDI-TOF MS measurement were performed as described (Kim 
et al., Analytical Biochemistry 2003, 316, 251). Direct 
DNA sequencing was conducted using energy transfer 
terminator chemistry and a MegaBACE 1000 capillary DNA 
sequencer (Amersham Bioscience) . 

D. Discussion 

Thirty polymorphic sites, including the most frequently 
mutated p53 codons, were chosen to explore the high 
multiplexing scope of the SPC-S3E method (Figure 1). 
Thirty primers specific to each polymorphic site were 
designed to yield SBE products with sufficient mass 
differences. This was achieved by tuning the mass of 
some primers using methyl-dC and dU to replace dC and dT, 
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respectively. Human genomic DNA was amplified by 

multiplex PCR to produce amplicons of three p53 exons . 
The 30 primers were mixed with the PCR products and 
biotinylated dideoxynucleotides for SBE to generate 3'- 
biotinylated extension DNA products. These products were 
then captured by streptavidin-coated solid phase magnetic 
beads, while the unextended primers and other components 
in the reaction were washed away. The pure DNA products 
were subsequently released from the solid phase and 
analyzed by MALDI-TOF MS. The nucleotide at the 

polymorphic site is accurately identified by the mass of 
the DNA extension product in a mass spectrum. Since only 
the DNA extension products are isolated for MS analysis, 
the resulting mass spectrum is free of non-extended 
primer peaks and their associated dimers, increasing 
accuracy and scope of multiplexing. The solid phase 
purification also facilitates desalting of the captured 
DNA, a process that is critical for accurate mass 
measurement by MALDI-TOF MS. 

The SPC-SBE genotyping approach was used to analyze 
nucleotide variations in 30 codons of 3 exons of the p53 
gene from 30 Wilms' tumors, 19 head and neck squamous 
carcinomas and 3 colorectal carcinomas. Primer sequences 
are shown in Table 3 along with the masses of the primers 
and their extension products. Extension products of all 
30 primers were resolved in the mass spectrum, free from 
any unextended primers, yielding digital data to 
unambiguously determine each nucleotide variation 
(Figures 13A-13C) . Unextended primers occupy the mass 
range in the mass spectrum decreasing the scope of 
multiplexing, and excess primers can dimerize to form 
false peaks in the mass spectrum (21). The excess primers 
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and their associated dimers also compete for the ion 
current, reducing the detection sensitivity of MS for the 
desired DNA fragments. These complications were 

completely removed in the SPC-SBE method. when using 
conventional ddNTPs, the mass difference between ddATP 
and ddTTP is 9 Da, which is difficult to resolve by 
MALDI-TOF MS (15). In the SPC-SBE method using 

biotinylated ddNTPs, the difference between A and T is 
increased to 66 Da, which fosters accurate detection of 
heterozygous genotypes. 



None of the 30 Wilms' tumor samples showed somatic 
mutations for the 30 polymorphic sites tested, yielding 
30 distinct peaks corresponding to the wild type p53 
sequences in a mass spectrum (Figure 13A) . in contrast, 
two of the 19 head and neck tumor samples contained a 
genetic variation; one at codon 157 (G/T heterozygous 
configuration; primary tumor biopsy; Figure 13B) and the 
other at codon 151 (C to T homozygous; squamous carcinoma 
cell line; Figure 14). in the three colorectal tumor 
cell lines tested, one (HCT-116) had 30 wild type P 53 
sequences for the 30 sites, yielding a mass spectrum 
similar to the one shown in Figure 13A, while the other 
two (HT-2 9 and SW-4 80) had a G to A homozygous mutation 
in codon 273 (Figure 13C) . Both heterozygous and 
homozygous genotypes were clearly detected in the 30 
codons with great accuracy. The G/T heterozygote 

(4684/4734 Da) was shown with two peaks corresponding to 
the wild type and mutant alleles, respectively (Figure 
13B). These data, confirmed by direct DNA sequencing, 
are consistent with the known paucity of the p53 
mutations in Wilms' tumor, and the known occurrence of 
such mutations in squamous carcinomas and colorectal 
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carcinoraas. 

It has been reported that MALDI-TOF MS could detect DNA 
sequencing fragments up to 100 bp with sufficient 
resolution using cleavable primers (29). The mass 
difference between each adjacent DNA sequencing fragment 
is approximately 300 Da. In principle, with a mass 
difference of 100 Da for each primer in designing a 
multiplex SNP analysis project using the SPC-SBE method, 
at least 300 SNPs can be analyzed in a single spot of an 
MS sample plate. Thus, each MS sample plate with 384 
spots can produce over 100,000 SNPs, which is roughly the 
number of tag SNPs required to identify all the 
haplotypes in the human genome. This level of 

multiplexing should be achievable by mass tuning the 
primers with nucleotide analogues containing stable 
chemical groups (28). It is anticipated that the SPC-SBE 
high-throughput digital SN? detection approach will have 
wide applications in screening a repertoire of genotypes 
in candidate genes as potential markers for cancer and 
other diseases. 
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Table 3. Thirty P 53 codons and the corresponding 30 SBE primers 
The position of the nucleotide variation tested in each codon is 
shown in bold. The primer sequence and modification is specified 
and the modified nucleotides are shown in bold. The mass of each 
primer is indicated along with the mass of all four possible SBE 
products. The mass values in bold specify the wild type nucleotide 
sequences (ddNTP-B = Biotinylated dideoxynucleotides) 



Primer 
Number 



Codon 



sequences (5 -3 ) 



1 
2 
3 
4 
5 
6 
7 
8 
9 
10 
11 
12 
13 
14 
15 
16 
17 
18 
19- 
20 
21 
22 
23 
24 
25 
26 
27 
28 
29 
30 



"5" 

5 
5 
5 
5 
7 
5 
8 
8 
5 
7 

a 
a 

7 
5 
5 
8 
7 
7 
7 
8 

a 

5 
7 
7 
7 
7 

a 

5 
5 



" 179 (CAT) 

157 (GTC) 
179 (CAT) 
163 (IAC) 

158 (CGC) 

248 (CGti) 
132 (AAG) 
298 (GAG) 

285 (GAG) 
161 (GCC) 

249 (AGG) 
266 (GGA) 

286 (GAA) 
258 (GAA) 
178 (IGU) 
152 (CCG) 
273 (CGI) 
234 (TAC) 

248 (CGG) 

249 (AGG) 
282 (CGG) 
278 (CCT) 
135 (IGU) 
245 (GGC) 
237 (A IG) 
242 (TGC) 
241 (TCC) 
275 (7GT) 
141 (TGC) 
175 (CGC) 



(±Cti&Mt££tAd 

GCCCGGCACCCGC 
GCGCTGCCCCCACC 
CtiCCATGGCCATCT 
CCGGCACCCGCGTCC 
TGGGCGGCATGAACC 
TCCC CTGCCCTCAACA 

AGGGGAGCCTCACCAC 
GACJAGA CC CiUC GCAC A 

CCCGCGTCCGCGCCATG 
GGCGGCATGAACCGGAG 
GTAGTGGTAA TCTACTGG 
AGAGA CC GGCGCAC AGAG 
C C TCACCATC ATC AC ACTG 
ACGCiAGGT TUTGAGGCGCT 
GTGGG7TCATTCCACACCCC 
ACGGAACAGCTTTGAGGTGC 
CTGACTGTACCACCATCCACT 
TCCTGCATGGGCGGCATGAAC 
GCATGGGCGGCATGAACCGGA 
TTGTGCC TGTCC TGGGAGAGAC 
TGAGGTGCGTGTTTGTGCCTGT 
CCCTGCCCTCAACAAGATGTTTT 
TGTC TAACAGTTCCTGCATGGGC 
TACCACCATCCACTACAACTACAT 
ACAAC TACATGTGTAACAGTTCCT 
AC TAC AAC TAC ATGTGTAAC AGTT 
GG AACAG CTTTGAGGTGCGTGTTT 
ATGTTTTGCCAACTGGCCAAGACCT 
CAGCACATGACGGAGGTTGTGAGGC 



None 
methyl C 

None 
methyl C 

None 

None 
methyl C 



Primer Mass of I 
^ddATP-B 



Mass 



methyl C 

None 
methyl C 

dU 
methyl C 
methyl C 
dU 
dU 
None 
None 
dU 
None 
dU 
None 
None 
dU 
None 
dU 
methyl C 
methyl C 
None 
None 



~38S 
3980 
4146 
4270 
4475 
4618 
4736 
4876 
4995 
5108 
5341 
5466 
5638 
5765 
5897 
6041 
6182 
6286 
6405 
6521 
6698 
6819 
6935 
7043 
7170 
7282 
7390 
7497 
7617 
7772 



0* 



454T 
4668 
4834 
49S8 

5163 

5306 

5424 

5564 

5683 

5796 

6029 

6174 

6326 

6453 

6585 

6729 

6870 

6974 

7093 

7209 

7386 

7507 

7623 

7731 

7858 

7970 

8078 

8185 

8305 

8460 



"455T 
4645 
4811 
4935 
5140 
5283 
5401 
5541 
5660 
5773 
6008 
6151 
6303 
6430 
6562 
6706 
6847 
6951 
7070 
7186 
7363 
7484 
7600 
7708 
7835 
7947 
8055 
8162 
8282 
8437 



"-436T 
4684 
4850 
4974 
5179 
5322 
5440 
5580 
5699 
5812 



8190 
6342 
6469 
6601 

6745 

6886 

6990 

7109 

7225 

7402 

7523 

7639 

7747 

7874 

7986 

8094 

8201 

8321 

8476 



"ddUTft-B 
4611 
4734 
4900 
5024 
5229 
5372 
5490 
5630 
5749 
5662 
6095 
6240 
6392 
6519 
6651 
6795 
6936 
7040 
7159 
7275 
7452 
7573 
7689 
7797 
7924 
8036 
8144 
8251 
8371 
8526 
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1. A method for determining the identity of a 
nucleotide present at a predetermined site in a DNA 
whose sequence immediately 3' of such predetermined 
site is known which comprises: 

(a) treating the DNA with an oligonucleotide primer 
whose sequence is complementary to such known 
sequence so that the oligonucleotide primer 
hybridizes to the DNA and forms a complex in 
which the 3' end of the oligonucleotide primer 
is located immediately adjacent to the 
predetermined site in. the DNA; 

(b) simultaneously contacting the complex from step 
(a) with four different labeled 
dideoxynucleotides, in the presence of a 
polymerase under conditions permitting a 
labeled dideoxynucleotide to be added to the 3' 
end of the primer so as to generate a labeled 
single base extended primer, wherein each of 
the four different labeled dideoxynucleotides 
(i) is complementary to one of the four 
nucleotides present in the DNA and <ii) has a 
molecular weight which can be distinguished 
from the molecular weight of the other three 
labeled dideoxynucleotides using mass 
spectrometry; and 

(c) determining the difference in molecular weight 
between the labeled single base extended primer 
and the oligonucleotide primer so as to 
identify the dideoxynucleotide incorporated 
into the single base extended primer and 
thereby determine the identity of the 
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nucleotide present at the predetermined site in 
the DNA . 



The method of claim 1, wherein each of the four 
labeled dideoxynucleotides comprises a chemical 
moiety attached to the dideoxynucleotide by a 
different linker which has a molecular weight 
different from that of each other linker. 

The method of claim 1 which further comprises after 
step (b) the steps of: 

(i) contacting the labeled single base extended 
primer with a surface coated with a compound 
that specifically interacts with a chemical 
moiety attached to the dideoxynucleotide by a 
linker so as to thereby capture the extended 
primer on the surface; and 

(ii) treating the labeled single base extended 
primer so as to release it from the surface. 

The method of claim 3 which further comprises after 
step (i) the step of treating the surface to remove 
primers that have not been extended by a labeled 
dideoxynucleotide . 

The method of claim 1, wherein step (c) comprises 
determining the difference in mass between the 
labeled single base extended primer and an internal 
mass calibration standard added to the extended 
primer . 



6. 



The method of claim 3, wherein the interaction 
between the chemical moiety attached to the 
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dideoxynucleotide by the linker and the compound on 
the surface comprises a biotin-st reptavidin 
interaction, a phenylboronic acid-sal icylhydroxamic 
acid interaction, or an antigen-antibody 
interaction . 



7. The method of claim 3, wherein the step of releasing 
the labeled single base extended primer from the 
surface comprises disrupting the interaction between 
the chemical moiety attached to the 
dideoxynucleotide by the linker and the compound on 
the surface. 

8. The method of claim 7, wherein the interaction is 
disrupted by a means selected from the group 
consisting of one or more of a physical means, a 
chemical means, a physical chemical means, heat, and 
light. 

9. The method of claim 2, wherein the linker is 
attached to the dideoxynucleotide at the 5-position 
of cytosine or thymine or at the 7-position of 
adenine or guanine. 

10. The method of claim 3, wherein the step of releasing 
the labeled single base extended primer from the 
surface comprises cleaving the linker between the 
chemical moiety and the dideoxynucleotide. 

11. The method of claim 10, where the linker is cleaved 
by a means selected from the group consisting of one 
or more of a physical means, a chemical means, a 
physical chemical means, heat, and light. 
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12. The method of claim 11, wherein the linker is 
cleaved by light. 



13. The method of claim 2, wherein the linker comprises 
a derivative of 4-aminome thyl benzoic acid, a 2- 
nitrobenzyl group, or a derivative of a 2- 
nitrobenzyl group. 



14. 



The method of claim 13 f wherein the linker compris 
one or more fluorine atoms. 
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15. The method of claim- 14, wherein the linker is 
selected from the group consisting of: 




CH 2 NHC(0)CF 3 
» 



(T - 




CH 2 NHC(0)CF 3 



and 




F T f 

CH 2 NHC(0)CF 3 



16. The method of claim 3, wherein the chemical moiety 
comprises biotin, the labeled dideoxynucleotide is a 
biotinylated dideoxynucleotide, the labeled single 
base extended primer is a biotinylated single base 
extended primer, and the surface is a streptavidin- 
• coated solid surface. 



17. The method of claim 16, wherein the biotinylated 
dideoxynucleotide is selected from the group 
consisting of ddATP- 1 1-biotin , ddCTP-1 1-biotin, 
ddGTP-ll-biotin, and ddTTP-16-biotin . 
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18. The method of claim 16, wherein the biotiny lated 

dideoxynucleotide is selected from the group 
consisting of: 




wherein ddNTPl, ddNTP2, ddNTP3, and ddNTP4 represent 
four different dideoxynucleotides . 
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20. The method of claim 16, wherein the biotinyiated 

dideoxynucleotide is selected from the group consisting 
of : 



H 

ddNTP' ^ 



O 



0 2 N— # 




c=/ H 



H f= 




X, 



O2N 



ddNTP3 





ddNTP<T v lC\-2 H "Ny/ 



H 



HN^NH 



O 

and 



0 2 N 



O 



wherein ddNTPl, ddNTP2, ddNTP3, and ddNTP4 represent 
four different dideoxynucleotides . 
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21. The method of claim 20, wherein the biotinylated 

dideoxynucleotide is selected from the group consisting 
of: 
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22. The method of claim 16, wherein the streptavidin- 
coated solid surface is a streptavidin-coated 
magnetic bead or a streptavidin-coated silica glass. 

23. The method of claim 1, wherein steps (a) and (b) are 
performed in a single container or in a plurality of 
connected containers . 



24. A method for determining the identity of nucleotides 
present at a plurality of predetermined sites, which 
comprises carrying out the method of claim 3 using a 
plurality of different primers each having a 
molecular weight different from that of each other 
primer, wherein a different primer hybridizes 
adjacent to a different predetermined site. 

25. The method of claim 24, wherein different linkers 
each having a molecular weight different from that 
of each other linker are attached to the different 
dideoxynucleotides to increase mass separation 
between different labeled single base extended 
primers and thereby increase mass spectrometry 
resolution . 
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